Working Group: Classification, Representation and Modeling

نویسندگان

  • Anish Das Sarma
  • Ander de Keijzer
  • Amol Deshpande
  • Peter J. Haas
  • Ihab F. Ilyas
  • Christoph Koch
  • Thomas Neumann
  • Dan Olteanu
  • Martin Theobald
  • Vasilis Vassalos
چکیده

This report briefly summarizes the discussions carried out in the working group on classification, representation and modeling of uncertain data. The discussion was divided into two subgroups: the first subgroup studied how different representation and modeling alternatives currently proposed can fit in a bigger picture of theory and technology interaction, while the second subgroup focused on contrasting current system implementations and the reasons behind such diverse class of available prototypes. We summarize the findings of these two groups and the future steps suggested by group members. 1 Theory and Technologies The Big Picture Uncertainty modeling, viewed as an interaction between theory and technologies, is a space of multiple dimensions that cover concepts, theory platforms, technology platforms and application features. More specifically: A Theory Matrix that is the product of a vector of uncertainty concepts and a vector of possible theory platforms represents multiple uncertainty formalisms. Example uncertainty concepts include tuple uncertainty (where the existence of a database record is an uncertain event), and value uncertainty (where values of some of the attributes/features of some entity are uncertain or unknown). Example theory platforms include probability theory and fuzzy logic. The interaction between possible uncertainty formalisms and current technologies (e.g., relational and XML data engines) has produced diverse uncertaintyaware technologies currently proposed by different research groups. For example, introducing uncertainty formalisms based on tuple and value uncertainty and probability theory in relational platform produced probabilistic database engines such as ORION [1] and TRIO [2–4]. Finally, the features implemented Dagstuhl Seminar Proceedings 08421 Uncertainty Management in Information Systems http://drops.dagstuhl.de/opus/volltexte/2009/1941

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

A New Document Embedding Method for News Classification

Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...

متن کامل

A Simplified Working Classification Proposed for Myxoid Tumors of Oral Cavity

Dear Editor-in-Chief Myxoid tumors of oral cavity encompass a broad spectrum of lesions, primarily neoplastic.(1) The significant variations in the biological behaviour; ranging from completely harmless to malignant neoplasm they require an accurate histopathological diagnosis to ensure a proper treatment. (2) Because of the considerable overlapping between the clinical and histopathologi...

متن کامل

Deep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning

Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...

متن کامل

Image Classification via Sparse Representation and Subspace Alignment

Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009